A Parallel-Model Speech Emotion Recognition Network Based on Feature Clustering

نویسندگان

چکیده

Speech Emotion Recognition (SER) is a common aspect of human-computer interaction and has significant applications in fields such as healthcare, education, elder care. Although researchers have made progress speech emotion feature extraction model identification, they struggled to create an SER system with satisfactory recognition accuracy. To address this issue, we proposed novel algorithm called F-Emotion select features established parallel deep learning recognize different types emotions. We first extracted from calculated the value for each feature. These values were then used determine combination that was optimal recognition. Next, input train test type emotion. Finally, decision fusion applied output results obtain overall result. analyses conducted on two datasets, RAVDESS EMO-DB, accuracy reaching 82.3% 88.8%, respectively. The demonstrate can effectively analyze correspondence between types.The MFCC best describes emotions Neutral, Happiness, Fear Surprise, Mel Angry Sadness.The mechanism improve

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving of Feature Selection in Speech Emotion Recognition Based-on Hybrid Evolutionary Algorithms

One of the important issues in speech emotion recognizing is selecting of appropriate feature sets in order to improve the detection rate and classification accuracy. In last studies researchers tried to select the appropriate features for classification by using the selecting and reducing the space of features methods, such as the Fisher and PCA. In this research, a hybrid evolutionary algorit...

متن کامل

Speech Recognition Approach Based on Speech Feature Clustering and HMM

The paper presents a Segment-Mean method for reducing the dimension of the speech feature parameters. K-Means function is used to group the speech feature parameters whose dimension has been reduced. And then the speech samples are classified into different clusters according to their features. It proposes a cross-group training algorithm for the speech feature parameters clustering which impro...

متن کامل

the emotion recognition system based on autoregressive model and sequential forward feature selection of electroencephalogram signals

electroencephalogram (eeg) is one of the useful biological signals to distinguish different brain diseases and mental states. in recent years, detecting different emotional states from biological signals has been merged more attention by researchers and several feature extraction methods and classifiers are suggested to recognize emotions from eeg signals. in this research, we introduce an emot...

متن کامل

Speech Emotion Recognition Using Scalogram Based Deep Structure

Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...

متن کامل

Hidden Markov model-based speech emotion recognition

In this contribution we introduce speech emotion recognition by use of continuous hidden Markov models. Two methods are propagated and compared throughout the paper. Within the first method a global statistics framework of an utterance is classified by Gaussian mixture models using derived features of the raw pitch and energy contour of the speech signal. A second method introduces increased te...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2023

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2023.3294274